Scaling and data collapse for the mean exit time of asset prices 
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We study theoretical and empirical aspects of the mean exit time of financial time series. The 
theoretical modeling is done within the framework of continuous time random walk. We empirically 
verify that the mean exit time follows a quadratic scaling law and it has associated a pre-factor 
which is specific to the analyzed stock. We perform a series of statistical tests to determine which 
kind of correlation are responsible for this specificity. The main contribution is associated with the 
autocorrelation property of stock returns. We introduce and solve analytically both a two-state 
and a three-state Markov chain models. The analytical results obtained with the two-state Markov 
chain model allows us to obtain a data collapse of the 20 measured MET profiles in a single master 
curve. 

PACS numbers: 89.65.Gh, 02.50.Ey, 05.40.Jc, 05.45.Tp 



I. INTRODUCTION 

The Continuous Time Random Walk (CTRW) for- 
malism introduced four decades ago by MontroU and 
Weiss [l| has been successfully applied to a wide and 
diverse variety of physical phenomena over t he y ears | 3| 
but only recently to finance |a liL la la ItL lol lift IllL ll^ . 
In this latter context, the efforts have been mostly fo- 
cused on the statistical properties of the waiting time 
between successive transactions and the asset return at 
each transaction. Different studies in different markets 
are conceiving the idea that the empirical distributions 
of both random variables are cornpatible with an asymp- 
totic fat tail behavior 

Within the CTRW formalism some of us have recently 
investigated the mean exit time (MET) of asset prices 
out of a given interval of size L for financial time series 
[T^ . This study shows that the MET follows a quadratic 
growth in terms of L for small interval lengths L. In 
the same study, this functional form was observed for a 
specific time series of the foreign exchange (FX) market, 
which is the U.S. dollar/Deutsche mark futures time se- 
ries (la] . 

In this paper we investigate both theoretically and em- 
pirically the MET of price returns traded in a stock ex- 
change. In our empirical investigation we study the MET 
of high frequency return time series of 20 highly capital- 
ized stocks traded in New York Stock Exchange. Em- 
pirical results about this market confirm that the MET 
follows a power law with a pre-factor that depends on 
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the specific stock chosen. This observation motivates us 
to first verify and then release some of the assumptions 
used in Ref. therefore generalizing the model dis- 
cussed in that paper. The theoretical generalization has 
been performed by introducing and solving a new two- 
state chain Markovian model able to both describe the 
quadratic scaling property of the MET and provide the 
data collapse of the MET stock pre-factor. 



We show that a satisfactory data collapse of the MET 
is obtained when some degree of autocorrelation in the 
stock returns is introduced in the two-state chain Marko- 
vian model. We attempt to further improve the accuracy 
by extending the model to a three-state Markov chain for 
which we are still able to evaluate the MET. Neverthe- 
less, empirical data show that the three-state model does 
not improve the quality of data collapse in the MET pro- 
files of the different stocks although the theoretical curve 
shows a better agreement with the empirical data than 
that of the two-state Markov chain model. 



The paper is organized as follows. In Sections ITU and 
Iml we discuss the MET behavior under the CTRW 
formalism and a series of simplifying assumptions. In 
Sect. lIVl we empirically investigate the scaling and data 
collapse properties of highly capitalized stock data. Sec- 
tion prelates the time correlation of stock return with 
the absence of data collapse of MET observed in the pre- 
vious section. In Sect. IVll we introduce and solve a two- 
state and a three-state Markov chain model to describe 
the empirical MET observations. Conclusions are drawn 
in Sect.ElII 
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II. MEAN EXIT TIME FOR I.I.D. PROCESSES 

In the most common version of the CTRW for- 
mahsm a given random process X(t) shows a se- 
ries of random increments or jumps at random times 
• • • ,t-i,to,ti,t2, ■ ■ ■ ,tn, - ■ ■ remaining constant between 
these jumps. Therefore, after a given time interval 
Tn = tn — tn-i, the proccss experiences a random in- 
crement AX„(t„) = X{tn) — X{tn-i) and the resulting 
trajectory consists of a series of steps as shown in Fig.^ 



Both waiting times r„ and random jumps AX„(t„) are 
assumed to be independent and identically distributed 
(i.i.d.) random variables described by their probability 
density functions (pdfs) which we denote by ^'(t) and 
h(x) respectively. 

However, in the most general representation of the for- 
malism, another function is needed to describe the time 
evolution of X{t). We denote this function by p{x,t) 
which is the joint pdf of waiting times and jumps: 



p{x, T)dxdT — Probja; < AX„ < a; + dx; t < Tn < t + dr}. 

I 



(1) 



X(t) 



x<tj 

X(t_,) 



X(tJ=x„ 



FIG. 1: A sample trajectory of the X{t) process along with 
the corresponding value of the random variable ta,bixo)- 



Note that the functions iP{t) and h{x) are the marginal 
probability density functions of p(x.t). We refer the 
reader to Ref. QlllllllllillllEflinifora more 
complete account of the CTRW formalism. 

In this paper we will apply the CTRW to study some 
aspects of the exit problem of financial time series. We 
will take as underlying random process X{t) the logarith- 
mic price X(tn) = ln(5(i„)), where S{t) is the stock price 
at time t. We specifically consider the problem of ob- 
taining the mean exit time oi X{t) out of a given interval 
[a, 6]. We assume that at certain reference time right 
after an event, the price has a known value X{tf)) — a;o, 
xq G [a.,b]. We focus our attention on a particular real- 
ization of the process and suppose that at certain time 
tn > to the process first leaves the interval (see Fig. 
We call the interval i„ — to the exit time out of the re- 
gion [a, 6] and denote it by ta,bixo)- This quantity is a 
random variable since it depends on the particular trajec- 
tory of X{t) chosen and the MET is simply the average 
Ta,b{^o) = E[to,b(a;o)]- 

The standard approach to exit time problems is based 



on the knowledge of the survival probability which is gen- 
erally quite involved However, within the CTRW 
formalism one can assume that the events compose a 
series of independent and identically distributed two- 
dimensional random variables. Under such an assump- 
tion, some of us ^3 have recently shown that one can 
obtain the MET directly, without making use of the sur- 
vival probability. In this framework the MET obeys the 
following integral equation JJj 



T{xo) = E[t] + / h{x — xo)T{x)dx, 



(2) 



where 



E[r] 



^/jir'ydr' 



is the mean waiting time between jumps. It is worth 
noticing that Eq. ^ is still valid even when t„ and AX„ 
are cross-correlated. In fact, in the case of an i.i.d. pro- 
cess the MET only depends on the pdfs of waiting times 
■0(r) and jumps h{x), but it does not depend on the 
particular form of the joint pdf p{x,t). However if we 
would remove the i.i.d. hypothesis we should specify a 
functional form for p{x,t). 

We now assume that returns increments are distributed 
according to an even pdf, h{x) = h{-~x), which also sat- 
isfies the following scaling condition, 



h{x) ^-h(- 



(3) 



where k is the scale of the fluctuations given by the 
standard deviation k of jumps, where — E[AAr^ — 
i?[AA"„]^]. The parameter k corresponds to the 
transaction-to-transaction volatility. Under these as- 
sumptions some of us showed that the MET out of a 
small region of size L = b — a ^ k is 12] 



T{a + L/2) = E[r] 



1 + 2H{0) 



(i/'(0+)+4i7(0f) - 



2k 



L 
2k 

2 



(4) 
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where it is assumed that the return process is initially in 
the middle of the interval [a,b]. 

In Ref. 10 some of us have applied the above result 
to the FX market of the U.S. doUar/Dcutsche mark fu- 
ture price. One important conclusion there was that the 
quadratic growth of the MET is still a good approxi- 
mation even for large intervals, i.e. L ^ k. It is less 
clear nonetheless that the coefficients of the polynomial 
which we have obtained in Eq. Q are those that better 
reproduce the global behavior of the MET. We refer the 
reader to Rcf. 12] for a more detailed discussion on this 
particular aspect of the problem. 

In this paper we analyze the scaling properties of the 
MET for 20 highly capitahzed stocks traded at the NYSE 
in the four year period 1995-1998 and spanning 1,011 
trading days. Table Q shows the list of stocks and the 
relevant parameters. We have measured the MET for 
each stock and compared them in the scaled variables 
T{a + L/2)/E[t] and L/2k. If all previous hypotheses of 
the model were correct, and the function H{u) is of uni- 
versal nature, one should observe the same curve for all 
stocks, i.e. a data collapse, as well as a quadratic growth 
in L at least for small L. However, Fig. [21 shows that 
there is a considerable spread of the curves, although 
the parabolic shape is recovered in all cases not only for 
small intervals, as expected, but for the whole investi- 
gated range of L/2k . 

TABLE L Summary statistics of the 20 stocks we study for 
the period 1995-1998. The second column gives the number 
of transactions, the third one the standard deviation of jumps 
and the fourth one the mean waiting time, i.e. the mean time 
between two intraday consecutive transactions. 



Ticker 


Transactions 


K (xlO-*) 


E[r] (seconds) 


AHP 


521,639 


9.18 


44.2 


AIG 


472,393 


6.98 


49.2 


BMY 


573,397 


7.30 


40.3 


CHV 


449,328 


8.47 


51.5 


DD 


645,164 


8.80 


36.0 


GE 


1,319,145 


7.17 


18.7 


GTE 


512,581 


13.10 


45.2 


HWP 


930,003 


9.20 


25.0 


IBM 


1,072,395 


5.42 


22.7 


JNJ 


728,686 


8.75 


32.1 


KO 


784,357 


9.11 


29.7 


MO 


971,700 


10.11 


23.9 


MOB 


461,669 


6.82 


50.0 


MRK 


971,842 


7.34 


23.8 


PEP 


767,929 


15.48 


30.3 


PFE 


1,003,518 


7.46 


23.1 


PG 


679,601 


7.36 


34.2 


T 


1,030,761 


11.13 


22.6 


WMT 


565,946 


15.80 


40.1 


XON 


674,412 


7.08 


34.5 




L/2k 



FIG. 2: (Color online) Mean exit times as a function of the 
region size L for the 20 investigated stocks. The x axis is 
scaled by 2k, whereas the y axis is scaled by the mean wait- 
ing time E[r]. The inset shows the same curves in a double 
logarithmic plot. 



III. A DISCRETE STATE MODEL 

Our first objective is to understand why the quadratic 
term governs both the long and short range behavior of 
the MET, without a drastic change of the general model 
and its assumptions. In the present approach we develop 
a model for h{x) based on the small-scale properties of 
the system. 

It is worth noticing that this approach is also used in 
the context of option pricing, when the fair price of a 
derivative product is obtained by making use of the bi- 
nomial trees methodology, where it is assumed that the 
stock price makes a jump up or down with some proba- 
bility 15j. Here we introduce the following symmetrical 
three-state discrete model: 

h{x) = Q Six) + [Six - c) + Six + c)] . (5) 

where Q represents the probability that the price remains 
unchanged, and c is the basic jump size jig . By substi- 
tuting this expression of hix) into Eq. (0) we obtain 

Tixo) ^ E[r] + g Tixo) + ^-^ [Tixo + c) + r(xo - c)] , 

with the convention that the term TixQ + c) only counts 
if xq < b — c, and similarly that T(a;o — c) only appears 
when xo > a + c. Let us analyze these two boundary 
conditions in greater detail. In general, the limits of our 
interval can be expressed in the following form: 

a ^ xq - il + Sa)c, 
b = Xo + im + £b)c. 
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with l,m G N and ea,£b & [0,1)- However it is easy to 
conclude that T{xo) can depend neither on nor on Si,. 
The only way of leaving the interval is by reaching the 
points X — xo — {I + l)c or X = xo + (m + l)c, because 
X = xo — Ic, and x — xq -\- mc lay always inside the 
interval. Therefore we will not loose generality by setting 
£a = £& = 0. After that, the length of the interval in the 
natural scale units of the problem is N = L / c = I + m, 
and the use of the following notation T„ = T{a + nc), 
with n G {0, 1, ■ ■ ■ , N — 1, iV}, arises in a natural way. 

Summing up, for the discrete model given by Eq. Q 
the MET out of the interval [a, b] obeys the following set 
of difference equations 



E[t] 
1-C 



2 (^n+l 



(n = 0, 1, 2, • • • , N) with boundary conditions: 

r_i = Tn+1 = 0. 

The solution to Eqs. ©-O is given by 

T„ = E[f](n + l)(iV + l-n), 



(6) 

(7) 
(8) 



where the random variable f is related to r in such a way 
that 



E[f] 



E[r] 

1-Q' 



(9) 



By repeating the above derivation leading to Eq. ((HJ, 
one can show that the random variable f represents the 
waiting time between jumps if one neglects zero-return 
trades, i.e. if one identifies the occurrence of a jump when 
X{t) actually changes its value. Thus, for instance, if 
AXi_i 0, AXi — 0, and AXi+i ^ with correspond- 
ing waiting times r^-i, t^, and Ti+i, we can replace the 
pair of events (AX^, AX^+i) with a single transaction of 
size AXj — AXi + AX^+i = AX^+i, taking a waiting 
time Tj = Ti + Ti_|_i. 

From Eq. (|SJ) we see that, for even values of N, the 
MET starting from the middle of the interval reads 



N/2 



m ( 1 + y 



(10) 



Looking at the general solution given by Eq. Q and also 
at Eq. H10() we clearly observe a quadratic behavior of 
the MET as a function of N, that is to say, as a function 
of the length of the interval. Indeed, from Eq. (|10|1 we 
have: 



Tia + L/2) _ / L_ 



(11) 



where N = L/c and = E[AXl - E[AXn?] = c^. The 
same kind of scaling also holds in terms of the parameters 
of the three-state model. In fact, from Eq. ^ we get 



T{a + L/2) 
EH 



1 



1-Q 



1 



2k 



(12) 



where = E[AX^ - 
large values of L/k, 



E[AXn]^] = (1 - Q) c2. Hence, for 



IV. 



T{a + L/2) / L 
E[7] V2^ 



CAUSES OF ABSENCE OF DATA 
COLLAPSE 



(13) 



In the previous section we have shown that a sim- 
ple discrete model for the jump distribution results in 
a quadratic growth of the MET valid for arbitrary values 
of the length of the interval and not only for small values 
of L/k, as was the case of Eq. Q. 

Unfortunately the discrete model does not properly ac- 
count for the spread of the MET curves observed in Fig.|21 
when we consider different stocks. One could argue that 
this spread can be controlled through the parameter Q 
appearing in Eq. H12|) since Q may distinguish one stock 
from another. However, as Eq. (|13|l shows the MET is 
practically independent of Q for large values of L and the 
difference between stocks would disappear in this range 
of lengths. Nevertheless, we clearly see in Fig.|21that the 
spread between stocks does not tend to vanish but, even 
in some cases, it increases with L. 

In this Section we try to identify the possible reasons of 
the failure of the data collapse of our previous model. We 
revisit some of our assumptions and derive consequences 
with the aim of finding the most important feature that 
we are leaving aside. The final goal is to improve our de- 
scription in the simplest possible way. Let us first sum- 
marize some potential causes for the lack of data collapse 
in the previous models: 

1. The probability density h(x) is different for differ- 
ent stocks. This would imply different mean exit 
time curves. 

2. There is some dependency on the cross-correlation 
between waiting times and price returns. 

3. The time auto-correlation of waiting times should 
be included in the model. 

4. The time auto-correlation of returns should be in- 
cluded in the model. 

We analyze the impact of these hypotheses on the 
empirical outcomes by performing shuffling experiments. 
Thus, in order to test the first hypothesis we shuffle inde- 
pendently the time series of AXn and t„ and we perform 
the mean exit time analysis on the shuffled time series. 
This shuffling destroys all the time- and cross- correla- 
tions but it preserves the shape of the pdfs h{x) and 
'0(r) and therefore the values of the scaling parameter k 
and the average waiting time E[t]. Figure |31 shows a very 
good collapse indicating that the assumption of a master 
pdf for the returns of all the stocks is good working hy- 
pothesis and it is not the reason for the lack of collapse 
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of Fig. 121 Moreover the MET curves are well fitted by 
the functional form y — {1 + x)'^ . 



2000 




FIG. 3: (Color online) Mean exit times as a function of the 
region size L for the 20 investigated time series after shuffling 
both waiting times and return increments. The x axis is scaled 
by 2k, whereas the y axis is scaled by the mean waiting time 
E[r]. The inset shows the same curves in a double logarithmic 
plot. The dashed curve represents the functional form y — 
{l + xf. 
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FIG. 4: (Color online) Mean exit times as a function of the 
region size L for General Electric obtained by performing four 
different shuffling experiments. The dashed (black) curve is 
the MET of the original data. The solid (red) line is the 
MET for data shuffled both in the waiting times and in the 
return increments. The squares (green) give the MET when 
one shuffles both AX„ and r„ but preserving the association 
(cross correlation) between the two and circles (blue) give 
the MET when one shuffles only the waiting times. The star 
(magenta) give the MET when one shuffles only the returns. 



The other three hypotheses can be similarly tested by 
performing three different shuffling experiments. Specifi- 
cally, hypothesis 2) can be tested by shuffling simultane- 
ously the two series and preserving the cross-correlation 
between AX„ and r„. Notice that even if Ref. |0| has 
shown that in the absence of autocorrelation of waiting 
times and returns (i.i.d. model) the MET is indepen- 
dent of the cross-correlations p{x, r), in the general case 
p{x,t) may play a role. Hypothesis 3) can be tested by 
shuffling only the series of waiting times and preserving 
the order in the series of returns. Finally, hypothesis 4) 
can be tested by shuffling only the time series of returns 
and preserving the order of the series of waiting times. 
Figure 01 shows the results of these shuffling experiments 
for the General Electric (GE) stock. The Figure shows 
that neglecting the autocorrelation function of waiting 
times does not change the MET curve. On the other 
hand, when one destroys the auto correlation of returns 
the MET (star) changes dramatically and becomes close 
to the one predicted by the i.i.d. model. 

By summarizing, in order to have a good model of 
the mean exit time one cannot neglect the correlation 
properties of price return, whereas the other correlations 
can be neglected as a first approximation. 

V. THE CORRELATION OF RETURN 
INCREMENTS 

What is the origin of correlation of returns? There 
are, in principle, two possible answers. One contribution 



comes from the linear autocorrelation of the increments 
of returns given by E[AX„AX,„]. A second contribution 
is related to nonlinear properties which can be exempli- 
fied by the nonlinear correlation E[|AX„||AArm|]. 

The first contribution can be easily evaluated by tak- 
ing the linear autocorrelation function of the increments 
of price returns. Figure |31 shows this quantity for GE 
(solid line). In the Figure it is clear that for a lag of one 
trade the linear autocorrelation function is negative and 
significantly different from zero. This is a known effect 
of transaction prices which is due to the presence of a 
spread between the best bid and the best ask (the "bid- 
ask bounce", see for example It is reasonable to 
assume that this short range correlation can be included 
in the model by using a Markov process. 

The nonlinear correlation is related to the volatility 
and can be quantified by plotting the autocorrelation 
function of |AAr„|. The result (dashed line) in Fig. [S] is a 
slowly decaying function indicating a long range correla- 
tion, which is probably not compatible with a Markovian 
model. 

We can test the relative importance of the two contri- 
butions to the correlation of returns by performing an- 
other shuffling experiment. We can obtain a surrogate 
time series with the same linear autocorrelation function 
but with an uncorrelated volatility (absolute value of re- 
turn increments). The method (see for example Chapter 
7 of Ref. |3| ) consists in taking the Fourier transform of 
the original time series and then randomize its phases. 
Because of the Wiener-Kinchine theorem the linear au- 
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-0.2 - 



lag (number of transactions) 

FIG. 5: (Color online) The solid (blue) line shows the auto- 
correlation function of the time series of returns for the stock 
GE. The dashed (red) line shows the autocorrelation of the 
absolute value of returns for the same stock. 



tocorrelation of the surrogate time series is the same as 
the original, but the nonlinear correlation will be zero. 
Therefore, we have a time series with the same bid-ask 
bounce properties but with an uncorrelated volatility. 
One difficulty of the method is the fact that the pdf of 
the surrogate series will be in general different from the 
original one (unless the time series is Gaussian). Since 
we know that, in general, the MET depends on the re- 
turn pdf, we should control that the distortion of the pdf 
introduced by the phase randomization is not critical in 
changing the properties of the MET. To this end we first 
compare the MET curve of the shuffled original time se- 
ries to the shuffled phase-randomized time series. These 
two series are both i.i.d. but with different pdfs. FigureEl 
shows that the two METs (dashed line and squares) are 
very close, thus indicating that the distortion of the pdf 
introduced by the phase randomization is not critical for 
the MET properties. 

We can now compare the METs of the original data to 
the ones for the phase-randomized time series. The two 
series have the same linear autocorrelation function, but 
the first one displays a clustered volatility whereas the 



In this Markovian case the conditional MET, 
T(a;o|AXo, To), will also depend on both the mag- 
nitude of the previous jump AXq = xq — x^i and its 
sojourn time tq — to — t^i (see Fig. Now, and 



randomized series does not. The two METs (see solid line 
and circles in Fig. ^ are again very close, thus showing 
that the most important time correlation contribution to 
the MET is the linear autocorrelation (bid- ask bounce), 
while the clustered volatility plays a minor role. 



VI. MEAN EXIT TIME FOR A 
MARKOV-CHAIN MODEL 

In order to incorporate the linear correlations of re- 
turns, we derive an integral equation for the mean exit 
time out of a given interval when the driving process is a 




FIG. 6; (Color online) Mean exit times obtained by per- 
forming shuffling experiments for the stock GE. The dashed 
(green) line refers to the real data shuffled both in time and 
in returns (the curve is also shown in Fig. |^ . The squares 
(blue) refer to the shuffled phase-randomized time series. The 
solid (black) curve refers to the original data (the curve is 
also shown in Fig. |2l and the circles (red) refer to the phase- 
randomized time series. 



continuous time random walk with memory. In particu- 
lar, wc will consider Markov processes within the CTRW 
framework. Such models lead to the following joint con- 
ditional probability density function: 



(14) 

I 

contrary to Eq. ij^J, the integral equation for the condi- 
tional MET depends on the complete joint probability 
density function. It reads: 



J 



p{x, t\x' , T')dxdT — Prob{x < AXn < x + dx; t < Tn < t + c?t| AX„_i = x'; t„_i — r'}. 
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poo nb 

T(a:o|AXo,ro) =E[t|AXo,to] + / dr p{x - xo,T\AXo,To)T{x\AX,T)dx, 

Jo J a 



(15) 



r 



where AX — x — xo and r = t — to. The level of com- 
plexity of Eq. (|15|l can be considerably reduced by noting 
that, as we have shown in Sect. it is possible to remove 
the correlation between consecutive waiting times with- 
out affecting the MET. Therefore, we will assume that 



J 



the correlation involving waiting times is negligible, and 
that all relevant information we have to consider when 
dealing with the n-th event is the magnitude of the pre- 
vious change. In such a case instead of Eq. (|14|l we write 



p{x, T\x')dxdT = Prob{x < AX„ < x + dx; t < Tn < t + drj AX„_i — x'}. 

I 



(16) 



Hence, the integral equation for the MET is simpler, be- 
cause on the right hand side of Eq. p5|l we can perform 
the integral over time. We thus obtain 

T{xo\AXo) = E[t|AXo] 

+ f h{x-xo\AXo)T{x\AX)dx. {17) 

J a 

In this case the MET only depends on the marginal 
probability density function of the return increments, 
hix\AXo), 



hix\AXo) 



p{x, T\AXo)dT, 



and on the conditional expectation of waiting times 
E[t|AXo] which has to be evaluated through the 
marginal pdf, ij{T\AXo), 

/oo 
p{x,T\AX„)dx. 
-oo 

We finally observe that although xq G [a, 5], we let AXq 
to be any real number. 



A. A two-state Markov chain model 

In order to solve Eq. H17|l and obtain explicit expres- 
sions for the MET that can be compared with empirical 
data, we follow the same approach of Sect. Illll and choose 
a discrete model for h{x\AXQ). At this point we can opt 
for a two-state model in which, at any time step, returns 
can only go up and down a fixed quantity c, or for a three- 
state model where in addition the return increment can 
be zero. We have shown in Sect. Illll that for an i.i.d. 
process both alternatives are equivalent. In the case of a 
Markovian process the equivalence is not complete. As 
we will see below, the final expressions obtained for the 



unconditional MET are slightly different although, for 
large values of i, the leading term is the same in both 
cases. 

Let us start with a two-state Markov chain model. In 
the symmetrical case in which up and down movements 
are equally likely, the conditional pdf for return incre- 
ments is 



h{x\y) 



ry 



5{x^ 



c — ry 



5{x + c), 



(18) 



2c ' ' 2c 
where r is the correlation between the magnitude of two 
consecutive jumps: 



Cov[AX„, AX„ 



(19) 



/Var[AX„]Var[AA„_i] 
From Eq. (|18() we see that the squared volatility, 

/>OC 

K^{y) = / x'^h{x\y)dx — c^, 



is independent of y. By substituting Eq. (|18|l into 
Eq. (|17|l we get the following difference equation for the 
MET: 



T(xo|Alo) 



E[t\AXo] 

c - r AAq 
2^ 



c -t- rAAo 



2c 

T{xq - c| - c) 



T{xo + c\c) 



(20) 



where AXq — ±c, T{xq + c|c) = if > 6 — c and 
r(xo — c| — c) = if Xo < a -I- c. We extend the notation 
introduced in Sect. Illll and define 

T'„,„=pi = r(a;o = a + nc|AAo ±c). (21) 

Now, Eq. (|2()|l is equivalent to the following set of recur- 
rence equations: 



1 + r 1 

Tn,n-1 — E[f] H — Tn+l^n H 

1 - r 1 

Tn,n+1 = E[f] H — Tn+l^n H 



-Tn-l.ni (22) 



(23) 



(n = 0, 1, • • • , N) with boundary conditions: 

r_i,o = Tn+i.n - 0. (24) 

Note that in writing Eqs. we have set E[f |±c] = 

E[f] which is consistent with the assumed symmetry be- 
tween up and down movements (see also Eq. I|25|) below). 
The solution to problem if^ -ll^ reads 

.1 = E[f]{N- - ^ - ^ 



1 - n) 



1 



T„,„+i = E[f](n+1) 



1 + (TV - n) 



1 + r 
1 - 



1 



The quantity of interest for our analysis is the uncon- 
ditional MET T„, which is related to T„^„±i by 



that is, 



Tn = E[f] 



N 1-r , 
1+ + n{N-n) 

1 + r 1 + r 



(25) 
(26) 



Thus the MET starting from the middle of the interval 
reads 

2' 



T, 



N/2 



m 



2r 



1 



N\ \-r N 



In this case, and contrary to the i.i.d. case given in Eq. 
(jTUIl . the MET is not a perfect quadratic expression. 
However, taking into account that N = L/c and k = c 
we have 

2 



T{a + L/2) 
E[f] 



2r 



1 



L 

^ 2k 



1 



. IH 

1 + r V 2k 



(28) 

and for large values of L/ii we recover the expected 
quadratic behavior in the leading term: 

2 



T{a + L/2) 

m 



1 



1 



1 



L 
2k 



(29) 



Note that the value of r depends on each particular stock. 
Therefore, the scaled MET defined as: 



Tsc{L) = 



_ (l + r\ T{a + L/2) 



1 



m 



(30) 



tends, for increasing values of i, to a quadratic function 
of the interval length which is independent of the partic- 
ular stock chosen. 

In Fig. [7| we compare the MET curves for the data 
scaled in the original way (inset), as given in Eq. 
and for the data scaled according to Eq. H3U|) . The inset 
of Figure [T] is essentially the same as Fig. |21 Figure [3 
shows that the scaling of Eq. (|29|l gives a significant im- 
provement with respect to the original one. We note 
that the scaled curves are systematically below the curve 
y — {1 + x)"^, probably because some correlation is not 
taken correctly into account by the two-state model. We 
will now attempt to correct this bias by introducing a 
three-state model. 




L/2k 



FIG. 7: (Color online) Comparison of the MET curves for the 
original i.i.d. model and the two-state Markov chain model. 
In this figure we have scaled the MET according to Eq. (I29II 
of the two-state Markovian model. The dashed (black) line is 
the parabolic curve y = (1-l-a:)'^. The inset is the same shown 
in Fig. |21for the i.i.d. process. 



B. A three-state Markov chain model 

One can argue that the two-state model just devel- 
oped would need an improvement in order to include 
zero-return transactions, i.e. those with AX„ = 0. In 
Sect, mil we have shown that for the i.i.d. process the in- 
clusion of a third possible state is completely equivalent 
to a two-state (up and down) model after redefining the 
mean waiting time and the volatility k by including the 
probability Q of zero-return transactions. However, when 
memory is present, as is now the case, this equivalence 
is not complete. We thus outline a discrete three-state 
Markov chain model. 

The Markov-chain model is now characterized by the 
following transition matrix: 



/ p(_|o) P(- 

P(0|-) P(0|0) P(( 

\pi+\-) pi+\o) Pi- 



rn 



1+) 



where P(-|-) = Prob{AX„ = -c\AXn-i = -c} and 
similar definitions for the rest of the matrix elements. 
Since we are also assuming that the process is symmetri- 
cal for positive and negative returns, the transition ma- 
trix T can be written in the following form: 



l-q 1-r „ 



1 — p q 1 — p 



where q = P(0|0) is the probability for trapping, 
p = P(-h) + P(+|-) = P(+|+) + P(-|+) 



(32) 
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and r measures the strength of the persistence: 

P(-|-)-P(+|-) p(+n-)_p(_|+) 



p( 



Pi 



P{-\- 



Note that the first order autocorrelation coefficient, de- 
fined as on the right-hand side of Eq. H19|l is now given 
by (c/. Eq. m) 



In writing Eqs. H36|) and 1)38(1 we have taken into account 
the symmetry 

E[r| +c] ==E[r| -c] =E[r|c]. 

Finally the solution to Eqs. ((36(1 - 1(38(1 with boundary 
conditions 



Cov[AX„,AX„_i] 
v/Var[AX„]Var[AX„. 



T-ifi = Tn+i,n — 



pr. 



(33) 



We also need to specify the unconditional probabilities 
of each state: 



(34) 



However, in this case Q is not an independent parameter 
as was the case of the i.i.d. model (c/ Sect. IIIIIl . In- 
deed, using the total probability formula and taking into 
account the values of the transition matrix T and the 
vector P, we get 



(c/Eq. ^) reads 
T 

Jn.n-1 - 



(TV + 1 - n) 



l + n 



1 — pr 



1 + pr 



(39) 




T — 

n,n " 



E[r|0] E[r] 



l-q 2(1 -Q) 



1 — pr 
1 -f pr 



[N -n){n + l) 



+ n{N - n + 1) +N 



p = i 



1 -Q 



E[r] 



(n + l) 



1 + (iV - n) 



1 — pr 



1 + pr 



(40) 



(41) 



Observe that when q = Q we have p = 1 — Q a-nd there 
is no trapping in the value of the random process. 
Now the pdf of the returns reads 



h{x\y) = aiy)dix) 



where 



2c 



a(2/) 



(35) 

[(c -I- ry) 6{x ~ c) + (c— ry) S{x + c)] , 



where E[r] is the (unconditional) mean waiting time 
which is related to E[r|c] and E[r|0] by 

E[t] =QE[T|0] + (l-Q)E[r|c]. 

In terms of T„^„±i and T„_„ the unconditional MET Tn 
is given by 



l-Q 



q, if y = 0, 

1-p, ify^O. 



and starting from the center of the interval we explicitly 
have 



In this case the integral equation ((17(1 is equivalent to the 
following set of difference equations for T„.„ and Tn^n±i- 



N/2 



E[t] 
l-Q 
E[r\0] 



Tn.n-l = E[T|c] + (l-p)r„ 



Q 



l+pr\ 2 



E\t] 



1 — pr 
1 + pr 



1 + ^ 



1 



l-Q 



(42) 



1 + r 



1 - r. 



(36) 



r„,„=E[T|0] -f qTn^n 
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Tn+l,n + Tn-l^n] (37) 



The main difference between this expression andEq. ^ 
is the final constant term that accounts for the two kind 
of trapping that the system may experience: the proba- 
bilistic one, q Q, and the temporal one, i?[r|0] ^ E[t]- 
By using the fact that = (1 — Q) c^, the leading term 
is again of the form: 



T„,„+i = E[r|c] + (l-p)r, 
+ P 



T{a + L/2) l-pr ( L 
E|r] 1 +pr \2k 



(43) 



1 — r 1 + r 



2 --r.,.. 2 

where Tn^n±i are defined as in Eq. I(21() and 

T„^n = T{xq = a + nc|AXo = 0). 



(38) 



In Fig.|Slwe show the MET curves for the data scaled ac- 
cording to Eq. 1(431) . It is worth noting that the quantity 
p r appearing in Eq. 1(43(1 can be estimated in two differ- 
ent ways. One could separately compute p, the proba- 
bility of a change in the price return provided a previous 
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change, and r which is the strength of the persistence. 
Ahernatively one can estimate directly the quantity p r 
by using Eq. . This second approach has the advan- 
tage of reducing the dependence of the estimates from 
the specific details of the model. This is the reason why 
in the curves shown in Fig. |H1 the quantity p r has been 
estimated by using Eq. H33|) . The scaling shown in Fig- 
ure |S1 is not satisfactory, we would even say that it is less 
satisfactory than the scaling corresponding to the two- 
state model which has been shown in Fig.[7| However, it 
is worth noting that the rescaled curves are not system- 
atically below the parabolic curve (1 -I- xY as it was in 
the case of the two-state model. 




FIG. 8: (Color online) MET curves scaled according to Eq. 
H4i-{|l of the three-state Markovian model. The quantity pr 
has been estimated by using Eq. (Onj- The dashed (black) 
line represents the curve y = {1 + x)^. 



both a two-state and a three-state Markov chain mod- 
els. The analytical results obtained through the two-state 
model allow us to get a quite satisfactory data collapse 
of the 20 MET profiles into a single parabolic curve as 
predicted by the model. However, this parabolic curve 
appears to be systematically above real data, that is, the 
model overstimates the mean exit time. 

We have been able to solve a three-state Markov chain 
model as well. Unfortunately this more detailed model 
does not provide an improvement on data collapse. The 
main advantage of this generalization is that the MET 
provided by the model lies close to the empirical curves. 
In other words, the three-state model does not overesti- 
mate the METs. 

We do not have a convincing explanation for this ob- 
servation but only some indications. Specifically, we have 
seen that the symmetries assumed in the three state 
model are not present in some empirical transition matri- 
ces 19J. Perhaps this assumption prevents the data from 
a convincing data collapse and the system would perform 
a better collapse in a model taking into account a certain 
degree of asymmetry in the Markovian transition matrix. 
However, obtaining the analytical solution for this more 
general case seems to be very involved and it has been 
left for future research. 

In conclusion, the MET and the search of a data col- 
lapse in the MET curves of stock prices provide a good 
occasion for testing the underlying hypothesis character- 
izing the return dynamics and for the improvement of 
the CTRW models describing this phenomenon. We be- 
lieve we have detected the essential ingredients to be ac- 
counted for a feasible model within the CTRW. We hope 
that the results may provide a certain insight on the way 
to bridge the gap between the description of the stochas- 
tic dynamics at v ery short time horizons with that of 
longer time scales [ll IS El El il H . 



VII. CONCLUSIONS 

This paper presents theoretical and empirical results 
about the MET of financial time series. Specifically the 
scaling property of the MET as a function of the size L 
has been confirmed to follow a quadratic law for a number 
of stock price time series. We empirically verify that the 
quadratic scaling law has associated a pre-factor which 
is specific to the analyzed stock. We have performed a 
series of tests to determine which kind of correlation are 
responsible for this dependence. It turned out that the 
main contribution is associated with the linear autocor- 
relation property of stock returns. 

We have therefore introduced and solved analytically 
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